Finding Higher Order Motifs under the Levenshtein Measure

نویسندگان

  • Ezekiel F. Adebiyi
  • Tinuke Dipe
چکیده

We study the problem of finding higher order motifs under the levenshtein measure, otherwise known as the edit distance. In the problem set-up, we are given sequences, each of average length , over a finite alphabet and thresholds and , we are to find composite motifs that contain motifs of length (these motifs occur with atmost differences) in distinct sequences. Two interesting but involved algorithms for finding higher order motifs under the edit distance was presented by Marsan and Sagot[7]. Their second algorithm is much more complicated and its complexity is asymptotically not better. Their first algorithm runs in , where , , is a concave function that is less than 1, and is the expected number of all monad motifs. We present an alternative algorithmic approach also for Edit distance based on the concept described in [3, 4]. The resulting algorithm is simpler and runs in expected time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Common Motifs under the Levenshtein Measure: Theory and Experimentation

Using our techniques for extracting approximate non-tandem repeats[1] on well constructed maximal models, we derive an algorithm to find common motifs of length P that occur in N sequences with at most D differences under the Edit distance metric. We compare the effectiveness of our algorithm with the more involved algorithm of Sagot[17] for Edit distance on some real sequences. Her method has ...

متن کامل

Structural Pattern Recognition for Industrial Machine Sounds Based on Frequency Spectrum Analysis

In order to discriminate different industrial machine sounds contaminated with perturbations (high noise, speech, etc.), a spectral analysis based on a structural pattern recognition technique is proposed. This approach consists of three steps: 1) to de-noise the machine sounds using the Morlet wavelet transform, 2) to calculate the frequency spectrums for these purified signals, and 3) to conv...

متن کامل

A Comparative Study of Glass-Working Motifs of Seljuks in Iran and Fatimids in Egypt during 12th-13th Centuries (A.D)

Glass-working was one of the dominant arts in the Islamic period. In the time of Iranian Seljuks and, contemporaneous with them, Egyptian Fatimids, this art was so innovative that such an era is considered as the glorious period in the history of Islamic glass-working in the two countries. In addition to contemporaneity of Seljuks and Fatimids, economic and cultural relations between the two re...

متن کامل

Improving Search and Exploration in Tag Spaces Using Automated Tag Clustering

In recent years we have experienced an increase in the usage of tags to describe resources. However, the free nature of tagging presents some challenges regarding the search and exploration of tag spaces. In order to deal with these challenges we propose the Semantic Tag Clustering Search (STCS) framework. The framework first groups syntactic variations using several measures based on the Leven...

متن کامل

Measuring Norwegian dialect distances using acoustic features

Computational dialectometry has been proven to be useful for finding dialect relationships and identifying dialect areas. The first to develop a method of measuring dialect distances was Jean Séguy, assisted and inspired by Henri Guiter (Chambers and Trudgill, 1998). Strongly related to the methodology of Séguy is the work of Goebl, although the basis of Goebl’s work was developed mainly in dep...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003